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FOREWORD 


This little book is devoted to three theorems in arithmetic, 
which, in spite of their apparent simplicity, have been the objects 
of the efforts of many important mathematical scholars. The proofs 
which are presented here make use of completely elementary means, 
(although they are not very simple). 


The book can be understood by beginning college students, 
and is intended for wide circles of lovers of mathematics. 
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A LETTER TO THE FRONT 
(IN LIEU OF A PREFACE) 


March 24, 1945 
Dear Seryozha, 


Your letter sent from the hospital gave me threefold pleasure. 
First of all, your request that I send you ‘‘some little mathemati- 
cal pearls" showed that you are really getting well, and are not 
merely trying, as a brave fighter, to relieve your friends' anxiety. 
That was my first pleasure. 


Furthermore, you gave.me occasion to reflect on how it is 
that in this war such young fighters as you happen to pursue their 
favorite occupation—the occupation which they cherished already 
before the war, and from which the war has tom them—so pas- 
sionately during every little respite. There was nothing like this 
during the last World War. In those days a young man who had ar- 
rived at the ffront almost invariably felt that his life had been dis- 
rupted, that what had been the substance of his life before had 
become for him an unrealizable legend. Now, however, there are 
some who write dissertations in the intervals between battles, and 
defend them on their return during a brief furlough! Is it not be- 
cause you feel with your whole being, that your accomplishments 
in war and in your favorite occupations—science, art, practical ac- 
tivity—are two links of one and the same great cause? And if so, is 
not this feeling, perhaps, one of the mainsprings of your victories 
which we, here at home, are so enthusiastic about? This thought 
gratified me very much, and that was my second pleasure. 


And so I began to think about what to send you. I do not know 
you very well—you attended my lectures for only one year. Never- 
theless I retained a firm conviction of your profound and serious 
attitude toward science, and therefore I did not want to send you 
merely some trinkets which were showy but of little substance sci- 
entifically. On the other hand I knew that your preparation was not 
very great—you spent only one year in the university classroom, 
and during three years of uninterrupted service at the Front you 
will hardly have had time to study. After several days' delibera- 


10 


tion I have made a choice. You must judge for yourself whether it 
is a happy one or not. Personally [ consider the three theorems of 
arithmetic which I am sending you, to be genuine pearls of our 
science. 


From time to time, remarkable, curious problems turn up in a- 
rithmetic, this oldest, but forever youthful, branch of mathematics. 
In content they are so elementary that any schoolboy can under- 
stand them. They are usually concerned with the proof of some 
very simple law governing the world of numbers, a law which turns 
out to be correct in all tested special cases. The problem now is to 
prove that it is in fact always correct. And yet, in spite of the ap- 
parent simplicity of the problem, its solution resists, for years and 
sometimes centuries, the efforts of the most important scholars of 
the age. You must admit that this is extraordinarily tempting. 


[ have selected three such problems for you. They have all 
been solved quite recently, and there are two remarkable common 
features in their history. First, all three problems have been solved 
by the most elementary arithmetical methods (do not, however, con- 
fuse elementary with simple; as you will see, the solutions of all 
three problems are not very simple, and it will require not a little 
effort on your part to understand them well and assimilate them). 
Secondly, all three problems have been solved by very young, be- 
ginning mathematicians, youths of hardly your age, after a series of 
unsuccessful attacks on the part of ‘‘venerable’”’ scholars. Isn't 
this a spur full of promise for future scholars like you? What an 
encouraging call to scientific daring! 


The work of expounding these theorems compelled me to pene- 
trate more deeply into the structure of their magnificent proofs, and 
gave me great pleasure. 


That was my third pleasure. 


I wish you the best of success—in combat and in science. 


Yours, 


A. Khinchin 


CHAPTER I 


VAN DER WAERDEN'S THEOREM ON 
ARITHMETIC PROGRESSIONS 
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In the summer of 1928 I spent several weeks in Göttingen. As 
usual, many foreign scholars had arrived there for the summer se- 
mester. I got to know many of them, and actually made friends with 
some. At the time of my arrival, the topic of the day was the bril- 
liant result of a young Hollander, van der Waerden? who at that time 
was still a youthful beginner, but is now a well-known scholar. 
This result had just been obtained here in Góttingen, and in fact, 
only a few days before my arrival. Nearly all mathematicians whom 
I met told me about it with enthusiasm. 


The problem had the following history. One of the mathemati- 
cians there (I forget his name**) had come upon the following prob- 
lem in the course of his scientific work: Imagine the set of all nat- 
ural numbers to be divided in any manner whatsoever into two parts 
(e. g., into even and odd numbers, or into prime and composite num- 
bers, or in any other way). Can one then assert that arithmetic pro- 
gressions of arbitrary length can be found in at least one of these 
parts? (By the length of an arithmetic progression I mean here, and 
in what follows, simply the number of its terms.) All to whom this 
question was put regarded the problem at first sight as quite simple; 
its solution in the affirmative appeared to be almost self-evident. 
The first attempts to solve it, however, led to nought. And as the 
mathematicians of Gottingen and their foreign guests were by tradi- 
tion in constant association with one another, this problem, provok- 
ing in its resistance, soon became the object of general mathemat- 
ical interest. Everyone took it up, from the venerable scholar to the 
young student. After several weeks of strenuous exertions, the 
problem finally yielded to the attack of a young man who had come 
to Göttingen to study, the Hollander, van der Waerden: I made his 
acquaintance, and learned the solution of the problem from him per- 
sonally. It was elementary, but not simple by any means. The prob- 
lem turned out to be deep, the appearance of simplicity was decep- 
tive. 

*B. L. van der Waerden, Beweis einer Baudetschen Vermutung, Nieuw Arch. 


Wiskunde 15, 212-216 (1927). (Trans.) 
**Most probably Baudet; cf. the preceding footnote. (Trans.) 
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Quite recently, M. A. Lukomskaya (of Minsk) discovered and 
sent me a considerably simpler and more transparent proof of van 
der Waerden's theorem, which, with her kind permission, I am going 
to show you in what follows. 


$2 


Actually, van der Waerden proved somewhat more than what was 
required. In the first place, he assumes that the natural numbers are 
divided, not into two, but into arbitrarily many, say k, classes 
(sets). In the second place, it turns out that it is not necessary 
to decompose the entire sequence of natural numbers in order to 

arantee the existence of an arithmetic progression of prescribed 
(arbitrarily large) length / in at least one of these classes; a cer- 
tain segment of it suffices for this purpose. The length, n (k, l), 
of this segment is a function of the numbers & and l. Of course it 
doesn't matter where we take this segment, so long as there are 
n(k,l) successive natural numbers. 

Accordingly, van der Waerden's theorem can be formulated as 
follows: 

Let k and l be two arbitrary natural numbers. Then there exists 
a natural number n (k, I) such that, if an arbitrary segment, of length 
n(k, I), of the sequence of natural numbers is divided in any manner 
into k classes (some of which may be empty), then an arithmetic 
progres sion of length l appears in at least one of these classes. 

This theorem is true trivially for /=2. To see this, it suffices 
to set n(k, 2) - k «l1; for if k+l numbers are divided into k classes, 
then certainly at least one of these classes contains more than one 
number, and an arbitrary pair of numbers forms an arithmetic pro- 
gression of length 2, which proves the theorem. We shall prove the 
theorem by induction on /. Consequently, we shall assume through- 
out the following that the theorem has already been verified for 
some number /22 and for arbitrary values of k, and shall show that 
it retains its validity for the number /+1 (and naturally also for all 
values of Kk). 
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According to our assumption, then, for every natural number & 
there is a natural number n (k, l) such that, if an arbitrary segment, 
of length n(k, l), of the natural numbers, is divided in any manner 
into k classes, there exists in at least one of these classes an 
arithmetic progression of length l. We must then prove that, for 
every natural number k, an n(k,l+1) also exists. We solve this 
problem by actually constructing the number n(&, /« 1). To this end 
we set 
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qo=l, no=nlk, dD 


and then define the numbers q;, qo, ..., 24, No, ... successively as 
follows: If 4,., and n, , have already been defined for some s>0, 
we put 


(1) q,-2n,,. d, p n7 kD (sel, 2,... 


The numbers n., q, are obviously defined hereby for an arbitrary 
s20. We now assert that for n(k, 14 1) we may take the number q,. 
We have to show then that if a segment, of length q,, of the se- 
quence of natural numbers is divided in any manner into k classes, 
then there is an arithmetic progression of length /+1 in at least one 
of these classes. The remainder of the chapter is devoted to this 
proof. 
In the sequel we set / «1 -/* for brevity. 
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Suppose then that the segment A, of length q}, of the sequence 
of natural numbers is divided in an arbitrary way into k classes. We 
say that two numbers a and b of A are of the same type, if a and b 
belong to the same class, and we then write a=b. Two equally long 
subsegments of A, 9«(a,a«1,...,a*r) and 6/^-(a4,a^«1,...,a^«r), 
are said to be of the same type, if asa, a+l=a’+l, ..., a«rea'ir, 
and we then write ô=’. The number of different possible types for 
the numbers of the segment A is obviously equal to k. For segments 
of the form (a, a « 1) (i. e., for segments of length 2) the number of 
possible types is k^; and in general, for segments of length m, it is 
k^". (Of course not all these types need actually appear in the seg- 
ment A.) 

Since q, -2n, 9, (see (1)), the segment A can be regarded as 
a sequence of 2n, , subsegments of length q, ,. Such subsegments, 
as we have just seen, can have k^*-1 different types. The left half 
of the segment A now contains n, , such subsegments, where 
ny, ,"n(k**71, 1) according to (1). Because of the meaning of the 
number n(k^*-1, I), we can assert* that the left half of the segment 
A contains an arithmetic progression of l of these subsegments of 


*Work with the initial numbers of the Ap. subsegments. (Trans.) 
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the same type, 
Ay Ao [EXE] A; 


of length 4,.,; here we say for brevity that equally long segments 
A, form an arithmetic progression, if their initial numbers form 
such a progression. We call the difference between the initial num- 
bers of two neighboring segments of the progression ^1, Ag, ..., A, 
the difference d, of this progression. Naturally the difference be- 
tween the second (or third, fourth, etc.) numbers of two such neigh- 
boring segments is likewise equal to d;. 

To this progression of segments we now add the succeeding, 
(Le 1)-st, term A, ^ (we recall that 1/2/41) which may already pro- 
ject beyond the boundary of the left half of the segment A, but 
which in any case still belongs entirely to the segment A. The seg- 
ments A4,À5,..., Ap, At^ then form an arithmetic progression, of 
length /’=1+1 and difference di, of segments of length q, ,, where 
A1, Ao; +.. A, are of the same type. We know nothing about the type 
of the last segment A, ^ 

This completes the first step of our construction. It would be 
well if you thought it through once more before we continued. 
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We now proceed to the second step. We take an arbitrary one of 
the first l terms of the progression of segments just constructed. 
Let this term be Ai so that 1 <¿:<l; Aa is a segment of length 
q,.j. We treat it the same way as we treated the segment A. Since 
9,21 72n,.54,.2; the left half of the segment A. can be regarded 
as a sequence of n, , subsegments of length q, ,. For subsegments 
of this length there are ka types possible, and on the other hand 
nj. =n(k*-2, I) because of (1). Therefore the left half of A; , must 
contain a progression of l of these subsegments of the same type, 
A, i, 0 &i2$D, of length q.p. Let d2 be the difference of this pro- 
gression (i.e., the distance between the initial numbers of two 
neighboring segments). To this progression of segments we add the 
(J+ 1)-st term A;a about whose type, of course, we know nothing. 
The segment Aa’ does not have to belong to the left half of the 
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segment Ai, any more, but must obviously belong to the segment 
Ai. 

We now carry over our construction, which we have executed 
up to now in only one of the segments A; x congruently to all the 
other segments A;, (1gi:¢2). We thus obtain a set of segments 
Aii (l<t,<l4 1«i2&l^ with two indices. It is clear that two ar- 
bitrary segments of this set with indices not exceeding l are of the 
same type: 

You no doubt see now that this process can be continued. We 
carry it out k times. The results ofour construction after the first 
step were segments of length q, ;, after the second step, segments 
of length q, », etc. After the k-th step, therefore, the results of the 
construction are segments of length go=1, i.e., simply numbers of 
our original segment A. Nevertheless we denote them as before by 


baise (lta, L5, EE il^). 


For 1$s$ k andl&i: ... i, ii, woot, Sl we have 
4 
(2) B, ioei, =A 
We now make two remarks which are important for what follows. 
1) In (2), if s«k and if i143 erst, are arbitrary indices 
taken from the sequence l, 2, ..., i, ¢4 then the number 


appears in the same position in the segment 
does in the segment 


^ 


7.7 . 
Liloeceky 


* 2 * acd .9. . 
iji, 88 the number DIN ET 


A; ^. Since these two segments are o the same type because of 


1 
(2), it follows that 


(3) Nig ige ig pee EES Repo 
if lSis -sip ig. tf Sl and 13i, piggy i, SU (gs. 
2) For sgk and t[=i, +1, Asus dit and A eus are ob- 


viously neighboring segments in the s-th step of our construction. 

Therefore for arbitrary indices ,,,,..., i> the numbers 

a PAIS F : ear in the same 
Oe hes oe Oe hee Ss hed 2 app 


position in two such neighboring segments, so that (with i/=i,,,) 


and A 
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(4) -d. 


Tar 7, "E MET : 
beselo ly Is ixl 1145.1 b bog & sS 
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Now we are near our goal. We consider the following k+l num- 
bers of the segment A: 


(5) a27 11 77...’ 


e 8s 9 e © 


Since the segment A has been divided into k classes, and we have 
k+1 numbers in (5), there are two of these numbers which belong 
to the same class. Let these be the numbers a, and a, (r«s), so 


that 
(6) Ay VAL 
r k-r s k-s 


We consider the ¿+ 1 numbers 


(7) G=A) disi... Asist). 


— a ~ 
r s-r k-s 


The first / numbers of this group (i. e., those with i<} belong to 
the same class because of (3). The last ({=14, however, is of the 
same type as the first because of (6). Consequently all ¿+1 num- 
bers in (7) are of the same type, and to prove our assertion we have 
only to show that these numbers form an arithmetic progression, 


i.e., that the difference c;,,—c, (1xi&/) does not depend on i. 
We set 1+1=2’ for brevity. Further let 
C; m= M iti itl’.d Oms), 


r m s-r-m kes 


"T -c .,and hence 
i+l 
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-r 
en-el —6. 4). 
i+] i ml i,m i,m-1 


Because of (4) we have 


Gm 7 C m-17 AY SRP oe AE ye A uad rasta TL 
S- — a ee eee” Sa a aa M ————" ~ — 
r m s-r-m kes r m-l s-r-m-«l k-s 


Thus the difference 


CTG E= dya do te td, 


and is indeed independent of i, which completes the proof of our 
assertion. 


You see how complicated a completely elementary construc- 
tion can sometimes be. And yet this is not an extreme case: in the 
next chapter you will encounter just as elementary a construction 
which is considerably more complicated. Besides, it is not out of 
the question that van der Waerden’s theorem admits of an even 


simpler proof, and all research in this direction can only be wel- 
comed. 


CHAPTER II 


THE LANDAU-SCHNIRELMANN HYPOTHESIS 
AND MANN'S THEOREM 
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You have perhaps heard of the remarkable theorem of 
Lagrange, that every natural number is the sum of at most 
four squares. In other words, every natural number is either 
itself the square of another number, or else the sum of two, 
or else of three, or else of four such squares. For the purpose 
at hand it is desirable to understand the content of this theorem 
in a somewhat different form. Let us write down the sequence of 
all perfect squares, beginning with zero: 


(S) 0, 1, 4, 9, 16, 25, ... . 
This is a certain sequence of whole numbers. We denote it by S, 
and imagine four completely identical copies of it, ZU Sy y Syp 
to be written down. x we choose an arbitrary number a. from S T 
an arbitrary number P from S5, an arbitrary number a2 fom S, 3, aad 
an arbitrary number a2 from S,, and add these SER together. 
The resulting sum 


(*) n-a, ata, +a, 
can be 

1) zero (if a, =a, =a, =a, =0); 

2) the square of a EM number (if in some representation (*) 
of the number n three of the numbers a), a,, a,, a, are zero and 
the fourth is not zero); 

3) the sum of two squares of natural numbers (if in some rep- 
resentation (*) of the number n two of the numbers a,, 89, da, G4 
are zero and the other two are not zero); 

4) the sum of three squares of natural numbers (if in some rep- 
resentation (*) of the number n one of the numbers a,, ao, 84, a, 
is equal to zero and the remaining three are not zero); 
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5) the sum of four squares of natural numbers (if in some rep- 
resentation of the number n all four numbers are different from zero). 

Thus the resulting number n is either zero or else a natural 
number which can be represented as the sum of at most four 
squares, and it is clear that conversely every natural number can be 
obtained by the process which we have described. 

Now let us arrange all natural numbers n which can be obtained 
by means of our process (i.e., by the addition of four numbers taken 
respectively from the sequences S}, S,, Sa; S,), in order of mag- 
nitude, in the sequence 


(A) 0, np Ry Mas oe 


(where O0<n, «n, <n <... so that if there are equal numbers among 
those constructed, only one of them appears in (A)). The theorem 
of Lagrange now asserts simply that the sequence (A) contains all 
the natural numbers, i. e., that n, - 1, n, =2, n4 23, etc. 

We shall now generalize our process. Let there be given k 
monotonically increasing sequences of integers which begin with 


zero: 
(409) 0 a) (1) a! 1) 
9 1 > > e» m > , 
(2 (2 (2 (2) 
(A 0,2/,2,, ee ET 
(45 0, a a), S a’ *) ae 
1 2 m 


We choose arbitrarily a single number from each sequence A“) 
(lgi<k) and add these k numbers together. The totality of all num- 
bers constructed in this manner, if we order them according to mag- 
nitude, yields a new sequence 


(A) 0, Ris Mos eo Rg ee 
of the same type, which we shall call the sum of the given se- 
quences A(D, AC), ..., AQ0. 
k o; 
ASAA nA eA N 


t=] 
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The content of Lagrange’s theorem is that the sum S4 54 S4 S con- 
tains the entire sequence of natural numbers. 

Perhaps you have heard of the famous theorem of Fermat, that 
the sum S+S contains all prime numbers which leave a remainder 
of 1 when divided by 4 (i.e., the numbers 5, 13, 17, 29, ...). Per- 
haps you also know that the famous Soviet scholar Ivan Matveye- 
vitch Vinogradov proved the following remarkable theorem, on which 
many of the greatest mathematicians of the preceding two centuries 
had worked without success: 

If we denote by P the sequence 


(P) 0, 2, 3, 5, 7, 11, 13, 17, ... 


consisting of zero and all prime numbers, then the sum P «P «P 
contains all sufficiently large odd numbers. 

I have cited all these examples here for only one very modest 
purpose: to familiarize you with the concept of the sum of se- 
quences of numbers and to show how some classical theorems of 
number theory can be formulated simply and conveniently with the 
aid of this concept. 
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As you have undoubtedly observed, in all the examples mention- 
ed we are concerned with showing that the sum of a certain number 
of sequences represents a sequence which contains either com- 
pletely or almost completely this or that class of numbers (e. g., 
all the natural numbers, all sufficiently large odd numbers, and 
others of the same sort). In all other similar problems the purpose 
of the investigation is to prove that the sum of the given se- 
quences of numbers represents a set of numbers which is in some 


” in the sequence of natural numbers. It is often the 


sense ''dense 
case that this set contains the entire sequence of natural numbers 
(as we saw in our first example). The theorem of Lagrange says that 
the sum of the four sequences S contains the whole sequence of nat- 
ural numbers. Now it is customary to call the sequence A a basis 
(of the sequence of natural numbers) of order k if the sum of k iden- 
tical sequences A contains all the natural numbers. The theorem of 


Lagrange then states that the sequence S of perfect squares is a 
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basis of order four. It was shown later that the sequence of perfect 
cubes forms a basis of order nine. A little reflection shows that 
every basis of order k is also a basis of order k +1. 

In all these and in many other examples the ‘‘density’’ of the 
sum which is to be established is determined by particular prop- 
erties of the sequences that are added together, i.e., by the spe- 
cial arithmetical nature of the numbers which go to make up these 
sequences (these numbers being either perfect squares, or primes, 
or others of a similar nature). Sixteen years ago the distinguished 
Soviet scholar Lev Genrichovitch Schnirelmann first raised the 
qnestion: To what extent is the density of the sum of several se- 
quences determined solely by the density of the summands, irre- 
spective of their arithmetical nature. This problem turned out to be 
not only deep and interesting, but also useful for the treatment of 
some classical problems. During the intervening fifteen years it 
received the attention of many outstanding scholars, and it has 
given rise to a rich literature. 

Before we can state problems in this field precisely and write 
the word "density" without quotation marks, it is evident that we 
must first agree on what number (or on what numbers) to use to 
measure the “‘density’’ of our sequences with (just as in physics 
the words ‘‘warm’’ and ''cold" do not acquire a precise scientific 
meaning until we have learned to measure temperature). 

A very convenient measure of the “‘density’’ of a sequence of 
numbers, which is now used for all scientific problems of the kind 
we are considering, was proposed by L. G. Schnirelmann. Let 


(A) 0, aa 


..., Q eee 
2’ : Tn? 


be a sequence of numbers, where, as usual, all the a, are natural 
numbers and a, «a, , , (n=1,2,...). We denote by A(n) the number of 
natural numbers in the sequence (A) which do not exceed n (zero is 
not counted), so that O<A(n)<n. Then the inequality 


0<Aln)<} 
n 


holds. The fraction A(n)/n, which for different n naturally has dif- 
ferent values, can obviously be interpreted as a kind of average 
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density of the sequence (A) in the segment from l to n of the se- 
quence of natural numbers. Following the suggestion of Schnirel- 
mann, the greatest lower bound of all values of this fraction is 
called the density of the sequence (A) (in the entire sequence of 
natural numbers). We shall denote this density by d(A). 

In order to become familiar at once with the elementary prop- 
erties of this concept, I recommend that you convince yourself of 
the validity of the following theorems: 

l. If a, >1 (i. e., the sequence (A) does not contain unity), then 
d(A) =0. 

2. If a, 214 r(n- 1) (i.e., the sequence (A), beginning with a}, 
is an arithmetic progression with initial term l and difference r), 
then d(4) - l/r. 

3. The density of every geometric progression is equal to zero. 

4. The density of the sequence of perfect squares is equal to 
Zero. 

5. For the sequence (A) to contain the entire sequence of nat- 
ural numbers (a, =n, n=l, 2, ...), it is necessary and sufficient that 
d(A) - 1. 

6. If (4) -0 and A contains the number l, and if «^O is arbi- 
trary, then there exists a sufficiently large number m such that 
A(m) « em. 

If you have proved all this, you are familiar enough with the 
concept of density to be able to use it. Now I want to acquaint you 
with the proof of the following remarkable, albeit very simple, lem- 
ma of Schnirelmann: 


(1) d(A + B) > d( 4) + d(B) - d(A) d(B). 


The meaning of this inequality is clear: the density of the sum of 
two arbitrary sequences of numbers is not smaller than the sum of 
their densities diminished by the product of these densities. This 
**Schnirelmann inequality'' represents the first tool, still crude to 
be sure, for estimating the density of a sum from the densities of 
the summands. Here is its proof. We denote by A(n) the number of 
natural numbers which appear in the sequence A and do not exceed 
n, and by R(n) the analogous number for the sequence B. For brevi- 
ty we set d(4) «a, d(B) - B, A+B=C, d(C) «y. The segment (1,7) of 
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the sequence of natural numbers contains A(n) numbers of the se- 
quence A, each of which also appears in the sequence C. Let a, 
and a,,, be two consecutive numbers of this group. Between them 
there are a,,, a, - 1-1 numbers which do not belong to A. These 
are the numbers 


a, * 1, a, *2, ..., a, +l=a,,,-1. 


Some of them appear in C, e.g., all numbers of the form a,+r, 


where r occurs in B (which we abbreviate as follows: r €B). There 
are as many numbers of this last kind, however, as there are num- 
bers of B in the segment (1,7), that is, B(J) of them. Consequently 
every segment of length / included between two consecutive num- 
bers of the sequence A contains at least B(/) numbers which belong 
to C. It follows that the number, C(n), of numbers of the segment 
(1,n) appearing in C is at least 


A(n) +2 B(I) 


where the summation is extended over all segments which are free 
of the numbers appearing in A. According to the definition of den- 
sity, however, B(/)2 Bl, so that 


C(n) 2 A(n) + BEI =A(n)+ Bin-A(n)}, 


because 24 is the sum of the lengths of all the segments which are 
free of the numbers appearing in A, which is simply the number 
n—A(n) of numbers of the segment (1,7) which do not occur in A. 
But A(n)zan, and hence 


C(n) 24()1 -B) + Bn zan(1-B)« Bn, 
which yields 
C(n)/n > a«- B -aB. 


Since this inequality holds for an arbitrary natural number n, we 
have 


y=d(C)za+ß —-af, Q. E. D. 


Schnirelmann's inequality (1) can be written in the equivalent 
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form 


1-d(A +B) <{1-d(A)}{1-d(B)}, 


and in this form can easily be generalized to the case of an arbi- 


trary number of summands: 


k 
1-d(Ai «4, «...«4,)& I 11- d(4)). 


It is proved by a simple induction; you should have no trouble in 
carrying it out yourself. If we write the last inequality in the form 


k 
(2) d(A +A,+...+A,)2 l- II 11-2(4)], 


it again enables one to estimate the density of a sum from the den- 
sities of the summands. L. G. Schnirelmann derived a series of very 
remarkable results from his elementary inequality, and obtained 
above all the following important theorem: 

Every sequence of positive density is a basis of the sequence 
of natural numbers. 

In other words, if a=d(A)>0, then the sum of a sufficiently 
large number of sequences A contains the entire sequence of nat- 
ural numbers. The proof of this theorem is so simple that I should 
like to tell you about it, even though this will divert us a bit from 
our immediate problem. 

Let us denote for brevity by A, the sum of k sequences, each 
of which coincides with A. Then by virtue of inequality (2), 


d(4,)21-(1-a)*. 
Since a0, we have, for sufficiently large k, 
(3) d(A,)>%. 
Now one can easily show that the sequence A,, contains the 
whole sequence of natural numbers. This is a simple consequence 


of the following general proposition. 


LEMMA. If A(n)+B(n)>n-1, then n occurs in A+B, 


Indeed, if n appears in A or in B, everything is proved. We may 
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therefore assume that n occurs in neither A nor B. Then A(n)= 


A(n -1) and B(n)= B(n-1), and consequently 
A(n-1) - B(n-1)»n- 1. 


Now let a,,a5,...,a, and b, b,,..., b, be the numbers of the 
segment (1,n—1) which appear in A and B, respectively, so that 
r-Á(n-1) s-B(n-1). Then all the numbers 

ajs Ay, --- Op, 


n—b,, n—b,, ..., n—b. 


belong to the segment (1, n—1). There are r+s=A(n—1)+B(n-1) of 
these numbers, which is more than n- 1. Hence one of the numbers 
in the upper row equals one of the numbers in the lower row. Let 
a; -n-—b,. Then n=a;+b,, i.e., n appears in A+B. 

Returning now to our objective, we have, on the basis of (3), 
for an arbitrary n: 


A, (n) » n »M(n -1) 


and therefore 
A, (n) - A, (n) » n —- 1. 


According to the lemma just proved, it follows that n appears in 
A,+A,=A,,- But n is an arbitrary natural number, and hence our 
theorem is proved. 

This simple theorem led to a series of important applications 
in the papers of L. G. Schnirelmann. For example, he was the first 
to prove that the sequence P consisting of unity and all the prime 
numbers is a: basis of the sequence of natural numbers. The se- 
quence P, it is true, has density zero, as Euler had already shown, 
so that the theorem which we just proved is not directly applicable 
to it. But Schnirelmann was able to prove that P «P has positive 
density. Hence P «P forms a basis, and therefore P indeed also. 
From this it is easy to infer that an arbitrary natural number, with 
the except ion of 1, can, for sufficiently large k, be represented as 
the sum of at most k primes. For that time (1930) this result was 
fundamental and evoked the greatest interest in the scientific 
world. At present, thanks to the remarkable work of I. M. Vinograd- 
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ov, we know considerably more in this direction, as I already relat- 
ed to you at the beginning of this chapter. 


$3 

In the preceding it was my purpose to introduce you in the 
shortest way possible to the problems of this singular and fasci- 
nating branch of number theory, whose study began with L. G. 
Schnirelmann's remarkable work. The immediate goal of the present 
chapter, however, is a specific problem in this field, and I now pro- 
ceed to its formulation. 

In the fall of 1931, upon his return from a foreign tour, L. G. 
Schnirelmann reported to us his conversations with Landau in 
Góttingen, and related among other things that in the course of 
these conversations they had discovered the following interesting 
fact: [n all the concrete examples that they were able to devise, it 
was possible to replace the inequality 


d(A+B)2>d(A)+d(B)—d(A)d(B), 


which we derived in §2, by the sharper (and simpler) inequality 
(4) d(A+B)2d(A)+d(B). 


That is, the density of the sum always turned out to be at least as 
large as the sum of the densities of the summands (under the as- 
sumption, of course, that d(4)+d(B)<i). They therefore naturally 
assumed that inequality (4) was the expression of a universal law, 
but the first attempts to prove this conjecture were unsuccessful. 
It soon became evident that if their conjecture was correct, the road 
to its proof would be quite difficult. We wish to note at this point 
that if the hypothetical inequality (4) does represent a universal 
law, then this law can be generalized immediately by induction to 
the case of an arbitrary number of summands; i.e., under the as- 
sumption that 


k 
Z d (4;) «1 
we have 


k k 
(5) 4 EA) Xd(4). 
i=] i= 
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This problem could not help but attract the attention of scholars, 
because of the simplicity and elegance of the general hypothetical 
law (4) on the one hand, and on the other because of the sharp con- 
trast between the elementary character of the problem and the dif- 
ficulty of its solution which became apparent already after the first 
attacks. I myself was fascinated by it at the time, and neglected 
all my other researches on its account. Early in 1932, after several 
months of hard work, I succeeded in proving inequality (4) for the 
most important special case, d(4) - d(B) (this case must be consid- 
ered as the most important because in the majority of concrete 
problems all the summands are the same). At the same time I also 
proved the general inequality (5) under the assumption that d(4,)- 
d(A,)=...=d(A,) (it is easy to see that this result cannot be deriv- 
ed from the preceding one simply by induction, but requires a spe- 
cial proof). The method which I used was completely elementary, 
but very complicated. I was later able to simplify the proof some- 
what. 

Be that as it may, it was but a special case. For a long time 
it seemed to me that a none too subtle improvement of my method 
should lead to a full solution of the problem, but all my efforts in 
this direction proved fruitless. 

In the meantime the publication of my work had attracted the 
attention of a wide circle of scholars in all countries to the Landau- 
Schnirelmann hypothesis. Many insignificant results were obtained, 
and a whole literature sprang up. Some authors carried over the 
problem from the domain of natural numbers to other fields. In short, 
the problem became “‘fashionable’’. Learned societies offered 
prizes for its solution. My friends in England wrote me in 1935 
that a good half of the English mathematicians had postponed 
their usual work in order to try to solve this problem. Landau, in 
his tract devoted to the latest advances in additive number theory, 
wrote that he “‘should like to urge this problem on the reader’’. But 
it proved to be obstinate, and withstood the efforts of the most able 
scholars for a whole series of years. It was not until 1942 that the 
young American mathematician Mann finally disposed of it: he found 
a complete proof of inequality (4) (and hence also of inequality 
(5)). His method is wholly elementary and is related to my work in 
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form, althongh it is based on an entirely different idea. The proof is 
long and very complicated, and Í could not bring myself to present 
it to you here. A year later, however, in 1943, Artin and Scherk pnb- 
lished a new proof of the same theorem, which rests on an altogeth- 
er different idea. It is considerably shorter and more transparent, 
though still quite elementary. This is the proof that I should like to 
tell you about; I have written this chapter on its account, and it 
forms the content of all the sncceeding sections. 


$4 
Suppose then that A and B are two sequences. We set 4+B=C. 
Let A(n), d(A), etc. have their usual meaning. We recall that all our 
sequences begin with zero, but that only the natural numbers ap- 


pearing in these sequences are considered when calculating A(n), 


B(n), C(n). We have to prove that 
(6) d(C) > d( A)  d(B) 
provided that d(4) - d(B) 1. For brevity we set d(4)- a, d(B)- B in 


what follows. 
FUNDAMENTAL LEMMA. If n is an arbitrary natural number, 
there exists an integer m (1&m&n) such that 


C(n) - C(n — m) 2 (a B) m. 


In other words, there exists a 'fremainder" (n—m+1,n) of the 
segment (1, 5»), in which the average density of the sequence C is at 
least a4. 

We are now faced with two problems: first, to prove the funda- 
mental lemma, and second, to show that ineqnality (6) follows from 
the fundamental lemma. The second of these problems is incompa- 
rably easier than the first, and we shall therefore begin with the 
second problem. 

Suppose then that the fundamental lemma has already been 
proved. This means that in a certain “‘remainder’’ (n—m+1,n) of 
the segment (1, n) the average density of the seqnence C is at least 
a+B. By the fundamental lemma, however, the segment (1,n—m) 


*remainder" (n-m-m^*1,n-—m) in which the 


again has a certain 
average density of C is at least a B. It is clear that by continuing 
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this process, the segment (1,7) is eventually divided into a finite 
number of subsegments, in each of which the average density of C 
is at least a B. Therefore the average density of C is also at 
least a * B in the whole segment (1,7). Since n was arbitrary, how- 
ever, we ba.. 


d(C)2a-B, Q.E.D. 


Thus the problem is now reduced to proving the fundamental 
lemma. We now turn to this proof, which is long and complicated. 


$5 
NORMAL SEQUENCES 


In all that follows we shall regard the number 7 as fixed, and 
all sequences which we investigate will consist of the number 0 
and certain numbers of the segment (1,n). We agree to call such a 
sequence N normal, if it possesses the following property: If the 
arbitrary numbers f and f’ of the segment (1,7) do not appear in N, 
then neither does the number {+f’—n appear in N (where the case 
f=f is not excluded). 

If the number n belongs to the sequence C, then 


C(n) - C(n - 1) -12 (a B)-1, 


so that the fundamental lemma is trivially correct (m=1). Conse- 
quently we shall assume in the sequel—I beg you to keep this in 
mind—that n does not occur in C. 

To begin with, the fundamental lemma is easy to prove in case 
the sequence C is normal. [ndeed, let us denote by m the smallest 
positive number which does not appear in C (m&n because n, by as- 
sumption, does not occur in C). Let s be an arbitrary integer lying 
between n—m and n; n—m<s<n. Then 0<s+m-—n<m. I say that 
s€C. For if this were not the case, the number s+m—n, because of 
the normality of C, would not appear in C. But we have just seen 
that this number is smaller than m, whereas m, by definition, is the 
smallest positive integer which does not occur in C. 

Hence all numbers s of the segment n—m<s<n appear in C, 


and therefore 


C(n) - Cín - m) - m -1. 
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On the other hand, by the lemma on p. 24, since m does not oc- 
cur in C=A+B we have A(m) + B(m) zxm— 1. Consequently 


(7) C(n) - C(n ^ m) > A(m) + B(m) > (a+ B)m, 
which again proves the validity of the fundamental lemma. 
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CANONICAL EXTENSIONS 


We now turn our attention to the case where the sequence 
C -A-« B is not normal. [n this case we shall add to the set B, ac- 
cording to a very definite scheme, numbers which it does not con- 
tain, and thereby pass from B to an extended set B,. The set 
A+B,=C, evidently will then be a certain extension of the set C. 
As I said before, this extension of the sets B and C (the set A re- 
mains unaltered) will be defined precisely and unambiguously; it is 
possible if and only if the set C is not normal. We shall call this 
extension a canonical extension of the sets B and C. Some impor- 
tant properties of canonical extensions will be derived, with whose 
aid the proof of the fundamental lemma will be completed. 

I now come to the definition of the canonical extension of the 
sets B and C. If C is not normal, there exist two numbers c and c^ 
in the segment (0, n), such that 


c£C, c'£C, c t*c'-n£€C. 
Since C =A +B, it follows that 
(8) ctc'-n-a*b (a€A, beB). 


Let By be the smallest number of the set B which can play the role 
of the number 5 in equation (8). [n other words, B, is the smallest 
integer be B which satisfies equation (8) for suitably chosen numbers 
cC, c'ÉC, a€A of the segment (0, n). This number By will be call- 
ed the basis of our extension. 

Thus the equation 


(9) c tc'-n*?a*f 


necessarily has solutions in the numbers c, c^, a satisfying the con- 
ditions 
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c£C, c'£C, a£A, 
where all three numbers belong to the segment (0,7). We write all 
numbers c and c which satisfy equation (9) and the enumerated con- 
ditions, to form a set C*. Clearly the sets C and C* do not have a 
single element in common. We call their union* (i.e., the totality 
of all numbers which occur either in C or in C*) 


C UC* -C, 


the canonical extension of the set C. 

Let us now examine the expression Bo n-c If c here is al- 
lowed to run through all the numbers of the set C* just constructed, 
the values of this expression form a certain set B*. According to 
equation (9), every such number B *n—c (c €C*) can be written in 
the form c/—a, where c'€C*, a €A. 

Let b* be an arbitrary number occurring in B*. Since it is of 
the form Bg *n—c, it is 28520; and since it is also of the form 
c^—a (c'€C*, a€A), it is &c'€n. Hence all numbers of the set B* 
belong to the segment (0,7). Moreover, if b* €B*, then b* £B,be- 
cause otherwise it would follow from b*-c'—a that c^-a«b* €A «B 
=C, which is false. 

Accordingly, the set B* is embedded in the segment (0,7) and 
has no elements in common with the set B. We put 


B UB* -B, 


and call the set B, a canonical extension of the set B. 
Let us show that 


A +B,=C,. 


First, let a2€A, b4€B,. We shall prove that a+ b,€C,. From 
b, €B, it follows that either b, EB or b,€B*. If b,€B, then a+ b, 
€A * B *CCC,. If b,EB*, however, then a+b, either occurs in C, 
and hence also in C,, or a+ b, ¢ C. But in this case (since b,, as an 
element of the set B*, is of the form Bo*n-c, c '£C) we obtain 


c=a+b, =at+By +n—c°€C. 


Therefore 


*Here and in the sequel we use the symbol Uto denote the union of sets, since we 
are using the symbol + in another sense. 
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c+c°^-n=a+ßB EA +B=C, 


where c £C and c’¢C. But then according to the definition of the 
set C*, 


c-acb, EC*CC,, Q.E.D. 


Thus we have shown that A+B, CC.. 

To prove the inverse relation, let us assume that c €C4, which 
means that either c £C or c EC*. If c EC, then c=a+b, a£A, bEB 
CB,. If, however, c EC*, then, for a certain a £A, the number b*= 
c—a, as we know, occurs in B*. We have c=a +b* €4+B*CA+B,. 
Therefore C,CA+B,. We also proved above that A+B,CC,. Con- 
sequently C, =A * B,. 

Now recall that according to our assumption, n £C. It is easy to 
see—and this is important—that the number n does not appear in the 
extension C,. For if we had n EC*, we could, by the definition of 
C*, put c’=n in equation (9), which would yield c-a« 89 €A+B =C, 
whereas c £C according to (9). 

If the extended sequence C, is not yet normal, then, because of 
A+B,=C, and n £C,, the sets A, By, and C, form a triple with all 
the properties of the triple 4, B, C that are necessary for a new 
canonical extension. We take a new basis Bı of this extension, de- 
fine the complementary sets B4, C* as before, put 


B,UBF=B,, CU Cf-C,, 


and are able to assert once more that 44 B5,- C, and n £C,. It is 
evident that this process can be continued until one of the exten- 
sions C, proves to be normal. Obviously this case must certainly 
take place, because in every extension we add new numbers to the 
sets B, and C, without overstepping the bounds of the segment 
(0,7). 


In this way we obtain the finite sequences of sets 


B=B CB C...CB,, 
Ca€,CC, CG 


where every B,,, (respectively C ui) contains numbers which do 


not appear in É (C) and which go to make up the set Bu (CH) so 


H 
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that 
Bus -B,U B, Cun -C,U Ch (O<spsh-l). 


We denote by By the basis of the extension which carries (B, C.) 
into (B i C a We have 


A«B,-C,, n£C, (0 £u A). 


Finally, the set C, is normal, whereas the sets C, (0€ u € h —1) are 
not. 
$7 
PROPERTIES OF THE CANONICAL EXTENSIONS 

We shall now formulate and prove in the form of three lemmas 
those properties of the canonical extensions which are needed lat- 
er. Only Lemma 3 will have further application; Lemmas 1 and 2 are 
required solely for the proof of Lemma 3. 

LEMMA 1. Bu? By (1&u &h —1); i. e., the bases of successive 
canonical extensions form a monotonically increasing sequence. 

. = * . * 

In fact, since KU pu E either Bp €B% or B, € 

By If By EB as then M is of the form 


By -By-itn-6 
where c €C7. , CC) and therefore c<n, so that B ? By and Lemma 
l is proved. If By EB ep however, then by the definition of the num- 
ber B, there exist integers a £Á, c £C, c'd C, such that 
e+c’-n=a+B,€ Cue 
But for Bu E B , we have 


(10) etc’-n=a+B,€A+B, 1-7 C, 


where c £C 2p e'f Cpi Hence, because of the minimal property 
of Pp-1° By, 2B, If Bu 7 By it would follow from (10) and the 
definition of the set Ch that 


^e C* 
u-1CĈp> c E€ Ch C Gy. 


Both are false, however, and therefore Bu? By: 
In the sequel we shall denote by m the smallest positive integer 
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which does not appear in C,. 


LEMMA 2. If ceC* (Ou &h-1) and n-m«c«n, then c» n-m« 
By: That is, all numbers c of the set C, which lie in the interval 
n—m<c<n are embedded in that part of this segment which is char- 
acterized by the inequalities n-m*B,«c«n. 

We have to show that 


c+m—n> a 

B, 

It follows from n-m<c<n that 
O<m+cec—n<m. 


Therefore, by the definition of the number m, 


m+c—n EC, 


Now 
C =C U CU Cn U... UOCE as 


We consider two cases. 
1) If me c-n eC, then 
m*c-n-acb,, ae, b, € Bp- 


t 
But m £C, and c £C, (the latter because c € C*). Therefore because 
of the minimal property of B, we must have b,2 By If b,7 B, it 
would follow from the definition of the set Ch that m EC*, which is 
false because CC C, 1 CC, and m £C,. Consequently bi^ By so 
that 


m+c-n=a+b,2b,>ßp> 


and Lemma 2 is proved. 
2) If c’=m+c-n EC} (u<sv<hk-1), then, by the definition of the 
set C* , c' satisfies an equation of the form (9), 


5 


c/-a-B,*n-c", 


where a£A4, c“’eC¥. Hence c/2c'-a»By2B, (where the last in- 
equality is given by Lemma 1), and Lemma 2 is again proved. 
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LEMMA 3. We have 
Ct (n) - Cr (n -m) = Br(m -1) (0£zu&£h—1). 


That is, the number of integers cEC* in the segment n~m<c<n is 
exactly the same as the number of integers b eB} in the segment 
O«b«m (of the same length). 


Let us examine the relation 
01) b-B,*n-c. 


By the very definition of the sets B* and Ci cEC* implies b eB*, 
and conversely. If, in addition, n—m+B,<c<n, then B, <5<m, and 
conversely. Hence 


Cho) = Cin -m+ By) = By(m -1) -BKB,- 


By Lemma 2, Cù (n-m +B )=C*(n-m). On the other hand, every 
b e B* can be expressed in the form (11), where c<n; b therefore ex- 
ceeds B, and consequently B4B) -0. It follows that 


Cr) - Cin - m) = Bim - 1), Q.E.D. 
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PROOF OF THE FUNDAMENTAL LEMMA 


It is very easy now to prove the fundamental lemma by proceed- 
ing from the results in $5 and appealing to Lemma 3 which was just 
proved. 

If we apply the result of $5 in the form of inequality (7) to the 
sequences A, B,, and C, (which is permissible because of the nor- 
mality of C,), we find that 


(12) C, (n) - C,(n-m)2 A(m) * B, (m), 


where m is the smallest positive integer which does not occur in 
C,. Obviously m £A and m £B,, so that we may write A(m-1) and 
B, (m—1) instead of A(m) and B,(m), respectively. 
We have 
C, 2CUC* UC*SU... UCT ,, 


B,-BUB*UB*U... UBt ,, 
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where the sets appearing in any one of these two unions are mutu- 
ally exclusive, so that AT 
C, (n) - C,(n -m) 2 C(n) - C(n - m) È | Cr (n) - Cr (n -m) I, 
j HM 
B (m) -B,(m - 1) 2 B(n- 1) « X Bikm—); 
p=0 

we have of course put C$ =C*, B5 - B*. On account of (12) it follows 
that 


h-1 
C(n) -Cí(n -m) € { Cin) - Cr -m) } 
p=0 
h-1 
>A(m)+B(m-1)+ X B*(m-1) 
poo " 
By Lemma 3, however, 
Cin) - Cin - m) = Bi lm - 1) (0&u &h-1), 
so that the preceding inequality becomes 


C(n) - C(n ^m) > Á(m) + B(m -1) » A(m) + B(m) z (a « B) m, 


which proves the fundamental lemma. 

As we saw in $4, this also completes the proof of Mann's theo- 
rem which solves the fundamental metric problem of additive num- 
ber theory. 


Doesn't Artin and Scherk's construction have the stamp of a 
magnificent masterpiece? Í find the outstanding combination of 
structural finesse and the extremely elementary form of the method 
especially attractive. 


CHAPTER III 


AN ELEMENTARY SOLUTION OF 
WARING'S PROBLEM 


$1 


You will recall the theorem of Lagrange, which was discussed 
at the beginning of the preceding chapter. It says that every natural 
number can be expressed as the sum of at most four squares. I also 
Showed you that this theorem could be stated in entirely different 
terms: If four sequences, each identical with 


(da) 0, 12, 22, ..., k2, ..., 


are added together, the resulting sequence contains all the natural 
numbers. Or even more briefly, the sequence (A 2) is a basis (of the 
sequence of natural numbers) of order four. I also mentioned that, as 
had been shown later, the sequence of cubes 


(A 3) 0, 18, 23, ..., 49, 


was a basis of order nine. All these facts lead in a natural manner 
to the hypothesis that, for an arbitrary natural number n, the se- 


quence 
(4,) 6, 182% a uus 


is a basis (whose order of course depends on n). This conjecture 
was also actually propounded by Waring as early as the eighteenth 
century. The problem proved to be very difficult, however, and it 
was not until the beginning of the present century that the universal 
validity of Waring’s hypothesis was demonstrated, by Hilbert (1909). 
Hilbert’s proof is not only ponderous in its formal aspect and based 
on complicated analytical theories (multiple integrals), but also 
lacks transparency in conceptual respects. The eminent French 
mathematician Poincaré wrote in his survey of Hilbert’s creative 
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scientific work, that once the basic motivations behind this proof 


were understood, arithmetical results of great importance would 
probably flow forth as from a cornucopia. Ín a certain sense he was 
right. Ten to fifteen years later, new proofs of Hilbert's theorem 
were furnished by Hardy and Littlewood in England and by I. M. 
Vinogradov in the USSR. These proofs were again analytic and for- 
mally unwieldy, but differed favorably from Hilbert's proof in their 
clarity of method and their conceptual simplicity, which left nothing 
to be desired. In fact, because of this, both methods became mighty 
scources of new arithmetical theorems. 

But when our science is concerned with such a completely ele- 
mentary problem as Waring's problem, it invariably attempts to find 
a solution which requires no concepts or methods transcending the 
the limits of elementary arithmetic. The search for such an elemen- 
tary proof of Waring's hypothesis is the third problem which I should 
like to tell you about. Such a fully elementary proof of Hilbert's 
theorem was first obtained in 1942, by the young Soviet scholar Y. 
V. Linnik. 

You are already accustomed to the fact that '*elementary'"" does 
not mean ''simple'". The elementary solution of Waring's problem 
discovered by Linnik is, as you will see, not very simple either, and 
it will take considerable effort on your part to understand and digest 
it. I shall endeavor to make this task as easy as possible for you 
through my exposition. But you must remember that in mathematics 
(as probably in any other science) the assimilation of anything real- 
ly valuable and significant involves trying labor. 

The ideas of Schnirelmann which I described to you in the begin- 
ning of the second chapter play an essential role in Linnik's proof. 
You will recall (I mentioned it at that time) how Schnirelmann 
proved his famous theorem that the sequence P consisting of zero, 
unity, and all the primes, is a basis of the sequence of natural num- 
bers: He showed that the sequence P «P has a positive density. 
This immediately yields the assertion, however, because, according 
to the general theorem of Schnirelmann which we proved on pp.24- 
25, every sequence of positive density is a basis of the sequence of 
natural numbers. The same method also lies at the basis of the proof 
of Hilbert's theorem discovered by Linnik. It all boils down to the 


39 


proof that the sum of a sufficiently large number of sequences (A,) 
is a sequence of positive density. As soon as this is accomplished, 
we can, by virtue of the same general theorem of Schnirelmann, re- 


gard Hilbert's theorem as proved. 


$2 
THE FUNDAMENTAL LEMMA 


If we add together k sequences, identical with A,, according to 
the rule in Chapter II, we evidently obtain a sequence A‘*) which 
contains zero and all those natural numbers which can be expressed 
as a sum of at most k summands of the form x", where x is an arbi- 
trary natural number. In other words, the number m belongs to the 


sequence A‘*), if the equation 
n n 
(1) XitX2te +X, = mM 


is solvable in nonnegative integers x; (1«i&k). As we saw in $1, 
the problem is to show that, for sufficiently large k, the sequence 
A‘*) has a positive density. 

For preassigned k and m, equation (1) in general can be solved 
in several different ways. In the sequel we shall denote by r,(m) the 
number of these ways, i.e., the number of systems of nonnegative 
integers x4, x5 ..., X, which satisfy equation (1). It is clear that the 
number m occurs in ACE) if and only if r, (m) 0. 

In the following, we shall assume the number n to be given and 
fixed, and shall therefore call all numbers which depend only upon 
n, constants. Such constants will be denoted by the letter c or c(n), 
where such a constant c may have different values in different parts 
of our discussion, provided merely that these values are constants. 
Perhaps you are rather unused to such “‘freedom’’ of notation, but 
you will soon become familiar with it. It has proved to be very con- 
venient, and appears more and more frequently in modern research. 

FUNDAMENTAL LEMMA. There exist a natural number k=k(n), 
depending only on n, and a constant c, such that, for an arbitrary 
natural number N, 


(2) r(m)<eN 


(1mgN). 


Once more, as in the preceding chapter, we are faced with two 
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problems: first, to prove the fundamental lemma, and second, to draw 
from the fundamental lemma the conclusion that we need, viz., that 
the sequence A‘*) has a positive density. This time again the sec- 
ond problem is considerably easier than the first, and we shall there- 
fore begin with the second problem. 

It follows immediately from the definition of the number r,(m), 
that the sum 


r (0) o r (D) +... +7,(N)=R,(N) 


represents the number of systems (x4, Xo, ..., Xg) of k nonnegative 
integers for which 


n n n 
(3) XitXoe +X, SN. 


Every group of numbers for which 
Osx, (N/k)! ^ (1«i«k), 


obviously satisfies this condition. To satisfy these inequalities, 
every x, can evidently be chosen in more than (N/k)!/" different 
ways (x,-0, 1,...,[Q/ E)! /^]).* After an arbitrary choice of this 
sort, the numbers x4,x5,..., X, may be combined, and so we have 
more than (N/k)*/" different possibilities for choosing the complete 
system of integers x, (l<i<k) so as to satisfy condition (3). This 
Shows that 


(4) RAN) 2(N/B)*/. 


We assume that the fundamental lemma has been shown to be 
correct, and that inequality (2) is satisfied for an arbitrary N. We 
now have to verify that inequality (2) is consistent with inequality 
(4) which we proved, only if the sequence A‘*) has a positive den- 
sity. The idea behind the following deduction is very simple: In the 
sum R,(N), only those summands r,(m) are different from zero, for 
which m occurs in A‘*), If 4(*) had density zero, then for large N 
the number of such summands would be relatively small; because of 
(2), however, every summand cannot be very large. Their sum R,(N), 
therefore, would also be relatively small, whereas according to (4) 
it must be rather large. 


* [a] denotes the largest integer < a, 
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It remains to carry out the calculations. Suppose that d(4(^))-0. 
Then, for an arbitrary small €» O0 and a suitably chosen ~N, 


ACXN)««N. 


Here the number N may be assumed to be arbitrarily large, because 
A‘*) (for an arbitrary k) contains the integer 1 (bear in mind Problem 
6 on p.22, which you solved). Applying the estimate (2) we get 


n N 
R,(N)= X rui) =r, (0) + 2 r,(m)< 1«c NG7/n)Y1 A CO (N)« l«ceN (5/ n) 


and hence, for sufficiently large N, 


R,()< 2ceN* ^n. 
For sufficiently small e, 
2ce« (1/k)*/", 
so that 
R(N) « (N/k)* ^^, 


which contradicts (4). Therefore we must have 
d(At*))>Q, 
But, as we already know, this proves Hilbert’s theorem. 


You see how simply it all comes out. But we still have to prove 
the fundamental lemma, and to do this we shall have to travel a long 
and difficult road, as in the preceding chapter. 


$3 


LEMMAS CONCERNING LINEAR EQUATIONS 


We shall have to go far back. It will therefore be well for you to 
forget completely for a while the problem which has been posed. I 
Shall call your attention to it when we return to it later. 

Right now, however, we have to find some estimates for the num- 
ber of solutions of systems of linear equations. The lemmas of this 
paragraph, moreover, are perhaps also of intrinsic interest, inde- 
pendent of the problem for whose solution they are required here. 


LEMMA 1. In the equation 
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(5) G4Z4 * 225 =m, 


let a4, ap m be integers with \a.|<|a.1|<A, and let a, and a, be 
relatively prime. Then the number of solutions of equation (5) sat- 
isfying the inequalities |z,|<A, |z2|<¢A, does not exceed 3A/|a,|. 
Proof: We may assume that a,>0, because otherwise we have 
merely to replace z, by — z, in every solution. 
Let 1zi, z2} and 1z1,22] be two different solutions of equation 


(5). Then from 


04,241 * 0225 7m, 
az] +aoz4=m 


we get 
as(zl—-25) =a, (z1—21) 


by subtraction. Accordingly the left-hand side of this equation must 
be divisible by a4. But* (a,,@2)=1, and consequently z2—2; must 
be divisible by a4. Now z2£z», and therefore |z2—z?|, as a multi- 
ple of a4, is not smaller than a4. Thus, for two distinct solutions 
iz 4, Zot and 1 zi, z3} of equation (5), we must have |z2- z2|2a.. 

In every solution {z,, z2} of equation (5), let us agree to call z, 
the first member and z^? the second. [t is obvious that the number of 
solutions of equation (5) which satisfy the conditions |z4|<A, 
Iza| 4, is not more than the number £ of second members which oc- 
cur in the interval «-A, A>. Since we have proved that two such sec- 
ond members are at least the distance a, apart, the difference be- 
tween the largest and smallest second members occurring in the in- 
terval <—A, A> is at least a4(£— 1). On the other hand, this difference 
does not exceed 24, so that 

a4(£—1) 24, 
(t —1) <2A/a,, 
t<(2A/a,)+1<34/a, 


(because, by assumption, a1 € Á, and therefore 1<¢A/a;). This proves 
Lemma 1. 

LEMMA 2, In the equation 
(6) G424 * 4922 * ... t ayz m, 


* (a4, G2) denotes the greatest common divisor of the integers a4 and ac. 
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let the a, and m be integers satisfying the conditions* 
la,|sA (sis), (a, 25, ..., a) - 1. 


Then the number of solutions of equation (6) satisfying the inequal- 
ities |z,| s 4 (1«i& D), does not exceed 


c() A’! /H, 


where H is the largest of the numbers |a,|, |ao|,..., |a;|, and c(J) is 
a constant depending only on l. 


Proof: If /22, Lemma 2 obviously becomes Lemma 1 (with c(2) = 
3). Accordingly Lemma 2 is already verified for /-2. We shall there- 
fore assume that /23 and that the truth of Lemma 2 has already been 
established for the case of l-1 unknowns. Since the numbering is 
unimportant, we may assume that |a,| is the largest of the numbers 
las]; laal... lal, i. e., Z 2 |ajl. 

There are two cases to consider. 

1) a, “ag 7... *a; , 70. Since (a1, a5 ..., aj) - 1, we have |a,|=H = 
l, so that the given equation is of the form +z;=m. In this equation 
each of the unknowns z,,z5,...,z;.; can obviously assume an arbi- 
trary integral value in the interval <-A,A>, and hence at most 
24 « 1&€34 values all told. As for z,, however, it can assume at most 
one value. Consequently the number of solutions of the given equa- 
tion satisfying the inequalities |z,|<A (1<i</J), does not exceed 


(34)-12 c(D) A}! 2 c()) A-1/H, 


which proves Lemma 2 for this case. 
2) If at least one of the numbers 2,22, ..., aj, is different from 
zero, then 


(a), a,, ..., aj 1) =5 
exists. Let us denote by H’ the largest of the numbers 
la/5 (gigl-)). 


Suppose now that the numbers Z, Z% ...,z, satisfy the given e- 
quation (6) and the inequalities |z;| <4 (1&i&1). We set 


(7) (a4/8) z, « (a5/ 8)z 2... (a4 1/8) z ms 


*(aj, 25, eee, 87) denotes the greatest common divisor of the integers in parentheses. 
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and hence 
QiZ4*d229t... t0, 42; 4 =m" 
Then obviously 
(8 5m’+a,z,=m 
and 


l-1 
|Bm^|s X lallzis 18 H'4, 


which implies that 
| m'A x 1H'A. 


Thus, if the numbers 2;,Z2,...,2, satisfy equation (6) and the ine- 
qualities |z,|<A (1<i</), then the integer m’ exists, which, with 
these numbers, satisfies equations (7) and (8), where [n1 x /H 'A. 
But in equation (8) clearly ôx |a;| and (6, aj) - 1 (otherwise we should 
have (a4, a2, ..., G11» aj) » 1). Hence, by Lemma 1, the number of so- 
lutions of equation (8) (in the unknowns m^ zj), for which [m'|& 
lH'A, |z| &A «IH'A, does not exceed 3/H 4/|aj|. For the same m, 
equation (7), according to Lemma 2 for equations in /—1 unknowns, 
has. at most c(/)4'?/H ' solutions in integral z; with |z,|<A. 

[t is evident, from what has been said, that the number of solu- 
tions (21, z5,..., z;] of equation (6) which satisfy the inequalities 
\z,|<A (1¢i<J), does not exceed 


(31H “A/|a,|) c(DAU2/H “= (04 1-1/]a, 
-c(D4'-1/H, 





which completes the proof of Lemma 2.* 
We shall now investigate the totality of equations of the form 


(9) Q1Z4+QoZo+...+4),2,=0, 


where |a;| «4 (1<i</J) and, as always, all a; are integers. Let B be a 
positive number whose relation to the number A is described by the in- 
equalities 1<A<¢B<c(l)A'!, and let /»2. We now want to estimate 


*You have probably noticed that in the last chain of equations the symbol c(l) oc- 
curred in different places with different meanings. On p.39 I prepared you for such a use 
of this symbol. 
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the sum of the numbers of solutions z;, |z;|<B (1&i&/) of all the e- 
quations (9) of this family. 

1? First let us make a separate examination of equation (9) for 
4,-a5-...-a,-Ü0 (it is a member of our family) and estimate the 
number of its solutions which satisfy the inequalities |z;| «B (1< 
ixl). Our equation is obviously satisfied by an arbitrary system of 
numbers 21, Z9,..-,2,, and we have merely to calculate how many 
such systems exist which satisfy the inequalities |z,|<8,|z2|<8, 
..-y|Z,|$B8. Since the interval <-B,+B> contains at most 2B +1 in- 
tegers, each z, can assume at most 2B +1 different values. Conse- 
quently the number of systems Íz;, Z2, ..., zı} of the type in which 
we are interested does not exceed (2B + 1)! «(3B8)! - c()B!. By our hy- 
pothesis, however, B &c(I)4'-1, so that c())B! 2 c())8-1B «c()(A B)'-1 
Hence, for the case where a,-a5-...-a,-0, equation (9) has at 
most c(/)(AB)!-! solutions of the type we are interested in. 

2° Even if only one of the coefficients a, is different from zero, 
the greatest common divisor of these coefficients, (a, a», ..., aj) -ó, 
exists. Suppose first that 6-1, and let H be the largest of the num- 
bers |a;| (¢=1, 2, ..., l). Clearly H is one of the integers in theinter- 
val «1, 4». Hence, H is either between 4 and 4/2, or between 4/2 
and 4/4, or between 4/4 and 4/8, etc. It is therefore possible to 
find an integer m20 such that 


(10) A/2nm*lcH«A/2n., 


According to Lemma 2, for an equation of the form (9) in which 
ó-1l and H satisfies the inequalities (10), the number of solutions 
z;, |z;|€ B, does not exceed 


c()B*-1/H «c()B8!-1/(4/20*1) 2 c()8!-12n 4. 
On the other hand, it follows from (10) that 
(11) ja s 4/27 (1&i«1). 


Consequently the number of equations of type (9) for which the ine- 
qualities (10) are satisfied is at most equal to the number of equa- 
tions of the same type which satisfy the conditions (11), i. e.,atmost 


(2(4/27) + 1) z(34/2n)! = c(])4!27!, 


Thus the sum of the numbers of solutions |z;|<B of all such e- 
quations of type (9) for which 8-1 and 42: +D) «H « A27", doesn't 


exceed 
(c(D)B'-12/4). e(D4!27n! = c(D (AB) -12- 0-1), 


Summing this estimate over all m20, we reach the following con- 
clusion: The sum of the numbers of solutions |z;|<B of all equa- 
tions (9) for which |a;| <4 (1&:€7/) and S=1 is at most 


e(l) (AB). 


3? [t remains for us to figure out the numbers of solutions of the 
required type for equations with 5>1. In this case equation (9) is 
evidently synonymous with the equation 


(a4/8)z , * (a2/8)z o+...+(a,/8)z =0, 
where only 
(a4/8,a 5/8, ...,a,/5) -1 


and the number A has to be replaced by the number 4/8. As we saw 
in 2°, the sum of the numbers of solutions |z;| <B of all such equa- 
tions, for a given, fixed 5, does not exceed* 


c(l) (4571 B)'-1 = c(D(AB)'-18- 0-1, 


Clearly now we have merely to sum this expression over all the pos- 
sible values of 5 (1<¢5<A). 

Thus we find that the sum of the numbers of required solutions 
of all equations of the form (9), where |a;|<4 (1si¢/J) and not all 
a; are equal to zero, does not exceed the value 


A 
«D (AB) X3-0-D« el (ABY-1. 1-1 e (DA BY--1, 
zl zn 


[To obtain the first relation we employ the inequality 


A 
Z(1/n1*!)«(q-1)/3, 


n=] 


*Since instead of A we now have to take the smaller number A/6, it is conceiv- 
able that the assumed condition B&c(l)A"™ is violated. You can verify, however, 
without any trouble, that we made no use of this assumption in Case 29, and that the 


result in 2? therefore does not depend on it. 
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which is valid for an arbitrary natural number q and for an arbitrary 
Á2.1 (we denote by g the number l-2, which is positive because we 
assumed that />2). Here is a simple proof: For n21 we have 


nI - (n 1»? -Í(n- 1)? n2] / n?(n - 1)? 
(nt 4 qnt-1&...- 1—n9) / n2(n 4 1)2 
> qn4-1/n?(n +1) » q/(n4 1)0*, 


and hence 


(n 1) (€*1) « g-14n- (n 1) 4]. 


By substituting successively n21,2,..., A — 1l in this inequality and 
adding all the resulting inequalities together we find that 


A 
Ern de q 1(1—4A -49)«1 / q, 
which implies that 


A 
Xa 1.0/9) «(g^ 1)/4, Q.E.D.] 
a= 
Comparing this with the result in 1°, where we obtained an esti- 
mate for the case a; =@2=...=a,=0, we reach the following conclu- 
sion: 
LEMMA 3. Let l>2 and 1&A€Bc(D)A'*-1. Then the sum of the 
numbers of solutions |z,|<B (1«ix/) of all equations of the form 


(9) Q12,'G222*...ta;2;-O, 
where |a;|<A (1&i&I), does not exceed 
c(XAB)'-, 
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TWO MORE LEMMAS 


Before proceeding to prove the fundamental lemma, we have to 
derive two more lemmas of a special type. They are both very simple, 
in idea as well as in form, and yet their assimilation might cause 
you some difficulty because they are concerned with the enumeration 
of all possible combinations, whose construction is rather involved. 
The difficulty with such an abstract combinatorial problem is that it 
is hard to put it in mathematical symbols: one has to express more in 
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words than in signs. This is of course a difficulty of presentation, 
however, and not of the subject itself, and I shall take pains to out- 
line all questions that arise, and their solution, as concretely as 
possible. 

We shall denote by A a finite complex (i. e., collection) of num- 
bers, not all of which are necessarily distinct. If the number a oc- 
curs A times in the complex 4, we shall say that its multiplicity is 
A. Let 24,, 05, ...,G, be the distinct numbers which appear in A, and 
let A4, Ào, ..., À, be their respective multiplicities (because the com- 


plex A contains all together 2, numbers). Let B be another com- 
i= 


plex of the same type, which consists of the distinct numbers b, , 
b, ..., b, with the respective multiplicities p1, 42, ees foe 
Let us investigate the equation 


(12) X*y-c, 


where c is a given number and x and y are unknowns. We are inter- 
ested in such solutions Íx, y] of this equation in which x is one of 
the numbers of the complex A (abbreviated x € 4) and y is one of the 
numbers of the complex B (y €B). If the numbers x=a, and y=), sat- 
isfy equation (12), this yields A;u, solutions of the required kind, 
because any one of the À; "specimens" of the number a;, which oc- 
cur in the complex 4, can be combined with an arbitrary one of the 
uy specimens of the number b, appearing in the complex B. But we 
have* Au, &4(A2* ug). Therefore the number of such solutions of 
quation (12), where x=a,, y-b,, is not greater than (A2 « u2).It 
follows that the number of all solutions x €A, y €B of equation (12) 
is not more than the sum XA? +p2). Here the summation is over all 
pairs of indices Íi, k} for which G,* b, 7c. Our sum is enlarged if we 
sum A? over all ¢ and p over all k (because every b, can be com- 
bined with at most one a;.) It finally follows, therefore, that the num- 
ber of solutions x £A, y EB of equation (12) does not exceed the 
number 


a( ZA + Eg). 


***The geometric mean is not greater than the arithmetic mean’’. Here is the sim- 


plest proof: 
0<A,-p,)?=A? tuz-2Ag and hence QA py eu. 
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On the other hand, let us consider the equation 
(13) x-y-0 


and calculate the number of its solutions x €A, y €A. Clearly every 
such solution is of the form x=y=a; (1&i&r). For a given i we ob- 
tain A2 solutions, because the numbers x and y can coincide, inde- 
pendently of one another, with any one of the A, specimens of the 
number a; appearing in A. Accordingly the total number of solutions 
x€A, y £Á of equation (13) is equal to ZAP In exactly the same 
i= 
way we find, of course, that the number of solutions x €B, y €B of 
s 
the same equation is equal to 2 pi. If we compare these results with 
the one found above, we reach the following conclusion: 
LEMMA 4. The number of solutions of the equation 
xt+y=c, x€A,yeB 


does not exceed half the sum of the numbers of solutions of the e- 
quations 


x-y-0, xed, yeAd 
and 
x-y-0, xeB, yeB. 


For the special case in which the complexes A and B coincide 


we obtain the following 


COROLLARY. The number of solutions of the equation 
x+y=c, x€A, yed 
does not exceed the number of solutions of the equation 


x-y=0, xtA, yeA. 


Now let k and s be two arbitrary natural numbers. We put 4-25-1, 
and investigate the equation 


X44+Xo+ [237 *Xj-C. 


Let 41,45, ..., Àj be finite complexes of numbers. Suppose that 


the complex A; (1x/&/) consists of the distinct numbers aij 0:55 95 
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with the respective multiplicities À,,,A,,,.... We are interested in 
the number of solutions of the equation 


(14) XitXo*.. X20, X; EÁ, (1&ixl). 
If we set 
X1tXot. *X)j$27* X(1/221* *** **1 =y 


(}/2 is of course an integer), then the given equation can be written 
in the form 


x+y=C, 


and Lemma 4, which we have just proved, can be applied to it. We 
have only to find out to which complexes the numbers x and y be- 
long. Since x; €A, (1<i<l), x can be an arbitrary number of the form 
Zi*Z2*t..t2,/5, Where z; £A; (1 &i&I/2). Similarly y can be an ar- 
bitrary number of the same form, where, however, z; £A 2i 
(1gisl/2). 

Hence, by Lemma 4, the number of solutions of equation (14) 
does not exceed half the sum of the numbers of solutions of the 
equation 
(15) x-y-0 
under the following two hypotheses: 

1) X-z4t22t..*Z|p/ 

yezitzóte tip 
where 
(16) z,€£A,, z;EA, (1lgi<l/2); 

2) x and y have the same form, but 
(17) ZEA Uap Zi EA Sts l/2). 

In both cases equation (15) may be rewritten in the form 


We conclude therefore that the number of solutions of equation (14) 
does not exceed half the sum of the numbers of solutions of equation 


5l 


(18) under the hypotheses (16) and (17), i.e., it does not exceed 
half the sum of the numbers of solutions of the equations 


1/2 
(18a) 2 Mz -2,)=0, z EÁ; 264. (10xi«1/2) 


Li 


and 
1/2 
(18 b) 2 (z;-z/)=0, z EÅ aay ziEÁQ/2)4i (10«i«1/2). 


Equation (18) has //2 summands on the left-hand side, i. e., half 
as many as the original equation (14). 
We set 
1/4 1/2 


E(u-iD-» ( -z =y, 


X Zi 
i=(l/4)+1 
and thereby bring equation (18) into the form 
x+y=0. 


To this we can apply Lemma 4 anew. It is evident that, just as we 
arrived at equation (18) from equation (14), we now get from equation 
(18) to the equation 


1/4 
(19) Xu *ui-— ui uit) =0, 


where we have to consider the sum of the numbers of solutions of 
this equation under the following (now four) hypotheses: 
1) uj, up uj, uj EA, 
2) up uj uss uj" EA agyi 
1 . 
3) uj, uj uy, u” €A Qsisl/4) 


(1/2)-i 
» 5» 555 
4) uj, uy ui, uj" EA aiji’ 


Since /=k-25, we can repeat this process s times. We evidently end 
up then with the equation 


k 
-l -1 s 
(20) Lys yla ey Vay (2° +0)_,,,-y(2")}=0, 


where we have to consider the sum of the numbers of solutions of 
this equation under 2* different hypotheses, viz.: 
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D Pedy yP eA n sy] 64, 
2) YEA, Lp WEA, pop T A 


(1<js25) 
25) AP EAL s papers YL CA, OS. 
If we put 
YDD ey) 4040) (1<j<2°), 
then equation (20) takes on the simple form 
(21) yey ees 5. t DL c 025 9. 


Here we are concerned with the sum of the numbers of solutions of 
equation (21) under the following 2? hypotheses, which differ from 
one another in the value of the parameter w (O<w< 25 — 1): 


where 
y) £4, , +1? y CA ke sees yf?) EA (41 )k (j=1, 2, ...,2*). 
Thus we can express the final result of our deduction in the form 


of the following proposition: 


LEMMA 5. If 12 &-2*, the number of solutions of the equation 
(14) XQtX2teotxj20, x EÅ; (sisi) 
does not exceed the sum of the numbers of solutions of the equation 


-1 -1 
(21) ye y GO uuu ey (2? 7L y(*7 D | (2°) 20, 


(j= 1,2, ..., 2°) 


y? EA kl yd? €A wees y, P? £A 


wkt2?* (w+1 )k 


under the hypotheses w=0, 1,...,2* — 1. 
Notice the connection between Lemma 4 and Lemma 5 for 
k-2s-1, l=2. 


This winds up our preliminaries, and we are ready now to begin 
the direct assault on the fundamental lemma. 
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PROOF OF THE FUNDAMENTAL LEMMA 


We are going to prove the fundamental lemma by the method of 
induction on n. [t is often the case in inductive proofs, that a 
strengthening of the proposition to be proved, considerably facili- 
tates its proof by the given method (and sometimes is actually what 
makes the proof feasible in the first place). The reason for this is 
easy to understand. In inductive proofs, the proposition is assumed 
to be correct for the number n—1, and is proved for the number n. 
Hence, the stronger the proposition, the more that is given to us by 
the case n—1; of course, so much the more has to be proved for the 
number n, but in many problems the first consideration turns out to 
be more important than the second. 

And so it is, in fact, in the present case. Of immediate interest 
for us is the number of solutions of the equation x44 x5... «xp —m 
(lgmsN) (where, according to the very meaning of the problem, 
Ocx;&ml/^&N1/n), But x” is the simplest special case of an n-th 
degree polynomial 


— n -1 
fæ) - agx" taat +a, \x+a,, 


and it will be to our advantage to replace the given equation (1) by 
the more general equation 


(22) flxs)+ flxa)+...+ flx,) =m, 


where the unknowns are subjected to the weaker conditions |x,|< 
N!/^ (1<i¢k). The proof of our proposition for equation (22) will 
give us more than we really need; but, as you will see, it is just 
this strengthening of our proposition which creates the possibility 
of an induction. And so, for m &N, let us denote by r,(m) the number 
of solutions of equation (22) which satisfy the conditions |x;| «N17 
(1«i& k). Of course we are still free to dispose arbitrarily of the co- 
efficients of the polynomial f(x) in the interest of the induction to be 
performed (provided only that the imposed conditions are satisfied 
in the case f(x)-x"). We are going to prove the following proposition: 


Let the coefficients of the polynomial f(x) satisfy the inequalities 
(23) la] &c()N?"^  (Ogign). 
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Then, for a suitably chosen k=k(n), 
ru (m) « c(n) NG/» (1£mgN). 


Since the inequalities (23) are obviously satisfied in the case 
f(x) » x^ for c(n)=1, this theorem is indeed a sharpening of our fup- 
damental lemma. 


Let us first consider the case n=l, f(x) 2 aox * a4. We set k(1)=2, 
so that equation (22) acquires the form 


Go(xi*x2)-2m-2a,. 


We are interested in solutions of this equation which satisfy the re- 
quirements |x,| £N, |x| <N. Thus at most 2N+1<3N values are pos- 
sible -for x4. But at most one x5 corresponds to every x1, so that 


r g(m)<3N , 


which completes the proof of our proposition for n=1 (k- 2). 
Now let n>l, and suppose that our assertion has already been 
verified for the exponent n—1. Put k(n—-1) - k^and choose 


k - k(n) -2n-2[4 1082k 1, 


where the exponent means the greatest integer not exceeding 4log 2k.’ 
In the sequel we shall set [4loggk/]-1-s, for brevity, so that 


(24) k-22n.25*!., 


To estimate the number, ry (m), of solutions of equation (22), we 
first apply Lemma 4 to it, setting 


Kk k 
x= 2 f(x;), y= eal 0 


The complex A (and the complex B which coincides with it in 
this case) consists of all sums of the form. 


Kk 
Xf), where Ix;| N77 (l<ig kh). 


By the Corollary of Lemma 4, r, (m) does not exceed the number of 
solutions of the equation x—-y -0, where x EA, y EÁ, i.e., 
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k/2 k /2 
x -àEfG y= Xf(yj, 


Ix|sN!^^, |y,| s N17" (1<isk/2). 


In other words, r,(m) does not exceed the number of solutions of the 
equation 


k/2 
(25) X27 fir 21-0, 


where |x,| «V! /^, ly &N!/^ (10xixk/2). We now set x,-y,=h, 


t 


(1gigk/2) and replace the system of unknowns Íx;, y;! by the sys- 
tem ly,,h,}; here we allow y; and h; (1<gi¢k/2) to assume all pos- 
sible integral values in the interval «-2N!/^, « 2N1/^», which can 
only increase the number of solutions of our equation. This means 
that every summand f(x;)-f(y;) in equation (25) is replaced by the 
expression 

n-l 

fy, «h9- f(y)- Za ly, +h)” -yr'"l 
n-1 n-v 
Ee Eee. 
If we change the variable ¢ of summation by putting 
v+t=u, 

so that 


n-v-t=n-u, t=u-v, 


we obtain 
n-l n 
fo, «h)-fo)-h, Xa, X Q-Qhpriyr* 


n u-l 
=h, X yr" Za, (We) Aer} 


1 L v=0 v u-—v 


-h, È a; yp "= ho), 


ucl 


where 


b= X a, y" 
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is a polynomial of degree n -l with coefficients 


t, 


u-l 
a, ,7 X a, (-0)gs-v-l- (1gigk/2) 


which depend on the numbers 7. 
Thus, in the new variables ly; À,], equation (25) assumes the 
form 


(26) h iy *hobs(y2) o... hu phy plyy,) =0. 


In this equation the numbers h, and y; may take on arbitrary in- 
tegral values in the interval «—2N1/^, 4 2N1/^5, where we must bear 
in mind that the coefficients of the polynomials ¢,(y) (of degree n- 1) 
depend on the numbers A. 

Mark well that we have proved the following so far: The number 
ry (m) which we are estimating, does not exceed the sum of the num- 
bers of solutions in integers y; |y,|<2N1/" (13i k/2), of all the 
equations (26) which can be obtained from all possible values of the 
numbers h, |h,| £2N! /" (1&i& &/2). 
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CON TINUATION 


We are now going to examine one of the equations (26), i.e., we 
shall regard the numbers 7, (1<i¢k/2) for a while as fixed. Let us 
apply Lemma 5 to this equation; the numbers /,¢,(y;) play the role 
of the unknowns x,, the number Vk - 2n-25 plays the role of the num- 
ber l, and we set 2n - ko for brevity. Recall once more that the num- 
bers k; appear in equation (26) not only explicitly but also through 
the coefficients of the polynomials ply). The complex A, to which 
the numbers x,=.¢,(y,) must belong consists, in the present case, 
of all numbers of the form 4,¢,(y,), where the numbers ^; have given, 
fixed values and the numbers y; run through the interval «—2N!/^, 
41 2N1/n5, 

According to Lemma 5, the number of solutions of equation (26) 
satisfying the requirements just described, does not exceed the sum 


of the numbers of solutions of the equation 
-l -1 
(21) yay Gu yT )-,Q* MD. ny (2720 


under the following 25 hypotheses which correspond to the values 
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of the parameter w -0, 1, ..., 25 - 1: 


0) .V (7) G) (7) 
y —yi tyo Fow +Yk > 
3 (1<j<25), 


ye (1«i € ko) 


wkoti 


where, remember, A, (1&£r&25*) is the complex of numbers of the 
form ^, $, (x) with prescribed A, and arbitrary y, |y,| « 2N1/n, 

For the case w=0 (which we choose merely as an example), 
equation (21) in expanded form looks as follows: 


byt yd +... 0 


1d yd usse yl 
T... © . o o 


s-l s-l s-l 
-dy(27 en gy f2 Dga, et o 
- iy 4442, ... «x2 =0, 
or, rearranging the summands, 
-1 -1 s 
iy Dey (OL cy Q7 )-yQ* "HL. zy] 


-1 -1 
" yf ey 24 en yA yg?" HD.. ay f2°)} 
oe le se vl Sy 4 


s-1 s-1 s 
* yc XQ ee XO hey +1) =. X ) |20; 


every one of the numbers y? is a number of the form hb (VU), 
where |vf?| «2N!/^. Hence the last equation can be rewritten in the 
form 


h iid (v if 1 ) +h loeb (277 )) -é(o 25 n )-,..— o N 
& hdd Ku D) e. d ood 2) e. thy d, (yO) A (4?) -0. 
By putting, for brevity, 
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$y) AC CUPS AO C -1 )- 425751 ) =... h; (ul ) =z, 
(1&i ko) 


this equation can be written quite shortly as follows: 


All together we have 2° equations of this sort, and their totality can 
be written down in the compact form 
ko 


LA 


ic] “koti Zw kg =O (O<w 25-1). 


For the present, however, we shall confine our investigation to e- 
quation (27), which may of course be regarded as typical. To esti- 
mate the number of solutions of this equation which interest us, we 
must first see within what limits the quantity (wu?) can vary. To 
this end we recall that (p. 55) 


AO) Xa,y'", 
where 


u-l 
au Xa Qe (1<igk/2). 


i. 
Hence it follows from our hypotheses |a, |«c (n)N"/^ and |A|«2N that 


u-1l i 
la; ul < EN Gp e(n)N v] )/n =e Gy ecDs E ey) l 


i. e, in view of u&n, 
(28) |a; ,| «c(n) N17, 


On the other hand, because of lv «2N1/^, we have |v?|^** xe(n) . 


NO-w)/n and consequently 


la; ,l-Iof |n e(n) NCi7 Dn Neu) nze(g)N 1), 


The same estimate (with another c(n)) holds for the whole $ uf), 
since the number of terms of this polynomial is equal to n. Accord- 
ingly 

lp GP) «c()N 717^ (Q sisk,s1sj«25). 
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But every z; is the sum of 2°=c(n) summands of the form £$(vU?), 
and therefore 


Iz,| «c(n)N 717^ — (1&igkg 


(with another c(n), naturally). This means that in equation (27) every 
z; can assume only the values lying in the interval «—c(n)N(n-1/n, 
4 c(n) N(n-1)/n5, 

Let m be one of these numbers. The equation z , -m can be sat- 
isfied in general not only in one but in several ways, because the 
definition of the number z; (p. 58) is such that one and the same val- 
ue of z; can very well result from different choices of the numbers 
vii) (1&j«2*). We now have to estimate the number of solutions of 
the relation z; 7m, i. e., of the equation 


(29) d$,Gf!)).... + o2 )- $2751 ) -..- d Gf" ) «m. 


For this purpose we shall finally have to apply the long-promised 
induction. We proceed as follows. 
First we rewrite equation (29) in the form 


$ (1) « 6, ((2)) e... «o (ol ?) 
-m-d$Gf 1) — 4h (277 90) re (y (2), 


This is possible because for k’=k(n—1)>1 (and we have seen that 
already &(1)-2) we have 


psa "t log ok’ ]-2 i 
(In detail: k’22, logok 21, 3logek 23, [4logak’1-2>4 loge k "-32 


logo", 9s-1 L9 logo k Mk ^) ' 
If we denote the right-hand side of the last equation by m 5, we 
get 


(30) $ (vl) e... (vf) =m’, 
Let us choose some particular values for the numbers 


(in the interval <-2N1/", +2N1/7>, naturally); then m also acquires 
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a definite value. To equation (30) we now apply the theorem to be 
proved, since ¢,(y) is a polynomial of degree n—1. We have to ver- 
ify that all the necessary hypotheses are fulfilled. We have 


fly) = a, 235-75 


where, according to (28), 


(31) la, „1< eln) N 9 = em) DH, 
and, as is easily seen, 

Im "| ce) N 
(because 7 and all $.,(//)) satisfy this inequality). 


In virtue of the last inequality, the role of N can be assumed by 
the number c(n) V^-3)/^; then the conditions (31), which the coef- 
ficients of the polynomial ¢,(y) satisfy, are precisely the conditions 
(23) with n replaced by n—1. Thus all the hypotheses are indeed 
fulfilled, and we can assert that the number of solutions of equation 
(30), for which [ut] « 2N1/n= 2(N(n-1)/n)1 /(-1) does not exceed the 
number 


^ 


n-l k k'-n41 


(32) c(n(N ^ yi !ze(qN 7" 








This estimate is obtained for the fixed values «* TI) v(?*), 
Clearly we have at most 
S Ld 


(33) (4 N1/n 11)25-k'ec()N  — 


such systems of values. The total number of solutions of the re- 
quired type, of equation (29), therefore does not exceed the product 
of the right-hand sides of (32) and (33), i. e., it is at most 


25-n41 


(34) c(n)N ^ 





We now return to equation (27). We saw before (p. 59) that every 
z, can assume only the values lying in the interval «— c(n)N - 1)/n, 
+ c(n)N (-1)/^5, Now we see that the ‘‘multiplicity 


" of each of these 


values (i.e., the number of ways of choosing the yfi) so as to sat- 
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isfy the equation) does not exceed the number (34). 

This result makes it possible to reduce the whole problem to an 
estimation of the numbers of solutions of linear equations. For at 
the end of $5 we reduced the estimation of ry (pm) to the estimation 
of the numbers of solutions of equations of the form (26). But as we 
proved by an application of Lemma 5, the number of solutions of 
equation (26), for which ly, «2N! 7, is at most equal to the sum of 
the numbers of solutions of 2° equations of type (27), i. e., already 
linear equations. In this connection we obtained limits within which 
the unknowns 2; are allowed to vary. À certain new difficulty (the 
price we have had to pay for the transition to linear equations) is 
that the new unknowns z, have to be considered with certain multi- 
plicities (for which we have also determined limits). 

Finally we must not forget that all these calculations are made 
under the assumption that the numbers À, are chosen and fixed. 
Therefore we still have to multiply the result obtained, by the num- 
ber of all such possible choices. 

The final result of this section, which we have to keep in mind, 
reads: Our estimated number r,(m) does not exceed the sum of the 
numbers of solutions in integers z,, |z;| & c(n)Nn-1/n, with multi- 
plicities A, & c(n)NG* -n*1)/n, of equations of the form 

ko 
(35) È k 


a) wkoti who +i =O: 


where w runs through the values 0,1,...,25—1, and the numbers h, 
(1&rz2*ko) assume, independently of one another, all integers in 
the interval <—2N1/", 4 2N1/n5, 

And so we see that we have now obtained an estimate for r,(m), 
in whose formulation the given polynomial f(x) does not appear, 
which lends this estimate a very general character. 
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CONCLUSION 


Now that we have reduced the problem to an estimation of the 
number of solutions of linear equations which are independent of the 
special form of the polynomial f(x), we quickly reach our goal with 
the aid of Lemma 3. 

Denote by K any particular combination of the numbers À, 
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|h; «2V1/» (1gigk/2), and by U,(K) the number of solutions of 
equation (35) for this fixed combination K and for a certain pre- 
scribed w, where we are concerned with those solutions z, which 
satisfy the inequalities |z,|<eln)N@-1)/n, with multiplicities A,< 
c(n)N(2? -n*1)/n. Then, according to the final result in the preceding 


section, 
29-1 
r,(m)< xix U (K) P 
K w=0 


where the summation over K extends over all admissible combina- 
tions K of the numbers À;. This can be written 


2-1 
rim) X (XU, (OH. 


It is immediately evident, however, that for different w the sums 
2 U,{K) do not differ from one another at all (because for different 


w the equations (35) do not differ from one another in any respect). 


We can therefore write 


ry (m) ¢ 25 2 Uo(K) - c (n) 2 Uo(K). 


Here Uo(K) is the number of solutions of the equation 

(36) hazı thazgt.e thy, z,,70 

for the given combination K of the numbers À,, |5^,| «2N! /n (1xi«k/2), 
where |z,| € c()NG- 17» and the z; have multiplicities A,<¢c(n)- 
N(2°-n+1)/n, Let us denote by US(K) the number of solutions of the 
same equation under the assumption that all z; are simple. Then 
clearly 


23-n41 
US(K)&ic() N ^ KoUS(K), 





or, recalling that ko=2n, 

Us (K) € c(n) N2 (7? -n* U&(K), 
and hence 
(37) r, (m)<e(n)N2(2°-n+1) I U&(K). 


Now let us note the following. Every K represents a certain admis- 
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sible combination of the values of all à; (l<gighk/2); the number 
U&(K), however, is completely determined by the values of the first 
ko-2n of these values (1¢i<2n), because they alone appear in e- 
quation (36). Of course when we choose a certain fixed combination 
K, we thereby also uniquely define a certain combination K’ of the 
values hi,ho,...,45,. But if, conversely, a certain combination K’ 
of the numbers hisha. hon is selected, there corresponds to it 
not the single combination K, but rather as many as there are ways 
of choosing the remaining "supplements" h; (2n<i<k/2). Since 
every h, must belong to the interval «—-2N1/^, 4 2N1/n5, it is evi- 
dent that to a combination K ' there correspond at most 


c(n) (N1/n)(/2)-2n —. e(n} Ns / 20)-2 
combinations K. Hence 


I U&(K) x c(n) N (7 20)-2 z US{K’), 


where U*(K^ is the number of solutions in integers z,, |z,| ¢c(n)- 
N(n-1)/n (1«i«2n) of equation (36) for the given combination K ' of 
the numbers h,, jk; | s2 N17» (1gi<2n), and the summation is to be 
extended over all such combinations. From (37) we therefore obtain* 


(38) nm) s c(m)N? Gn D NGI 20-2 USK") = eG)? (P0 UB (KO. 


Finally, UK’) is immediately estimated with the help of 
Lemma 3, where we have to put /=2n, A=2N!/", B=c(n)N&-1)/n, 
you can easily verify that al] the hypotheses of Lemma 3 are satis- 
fied. On applying this lemma we find that 


2, USK”) < cln) (AB)?-1 = c(n)N?^-, 


At last, inequality (38) yields 
-1 


stl, 


k 
r,(m)< c(n)N2(25 * 1-5), y 25-1 2 6(n)\N2°25 1-1, o(n)N 


which completes the proof of the fundamental lemma and thereby 
also of Hilbert's theorem. 
This proof, so exquisitely elementary, will undoubtedly seem 
very complicated to you. But it will take you only two to three 
*Recall that k=2n-25 * !, 
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weeks' work with pencil and paper to understand and digest it com- 
pletely. It is by conquering difficulties of just this sort, that the 
mathematician grows and develops. 


Xx X X 


